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Abstract 

This paper investigates the hedging effectiveness of a dynamic moving window OLS hedging model, formed 
using wavelet decomposed time-series. The wavelet transform is applied to calculate the appropriate dynamic 
minimum-variance hedge ratio for various hedging horizons for a number of assets. The effectiveness of the 
dynamic multiscale hedging strategy is then tested, both in- and out-of-sample, using standard variance reduction 
and expanded to include a downside risk metric, the time horizon dependent Value-at-Risk. Measured using 
variance reduction, the effectiveness converges to one at longer scales, while a measure of VaR reduction indicates 
a portion of residual risk remains at all scales. Analysis of the hedge portfolio distributions indicate that this 
unhedged tail risk is related to excess portfolio kurtosis found at all scales. 

1. Introduction 

The use of derivative securities, in particular futures contracts, allows both producers and consumers to reduce 
potential future price risk associated with a given spot position. Much of the large body of literature written on the 
issue of futures hedging have focussed either on the empirical estimation of the optimal hedge ratio (OHR) or the 
derivation of the OHR using different objective functions. Many approaches to obtain optimal hedge ratios have 
been suggested, both static and dynamic. Static hedging techniques include minimum variance, mean-variance, 
mean-Gini and generalized semi-variance. Dynamic hedge ratios have also been proposed, applying techniques 
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such as GARCH or moving-window estimation to capture changes in the relationship between assetsjj However, 
few empirical studies consider the effect of the hedging horizon on the optimal hedge ratio, even though various 
hedging participants may have very different hedging horizons. Due to the sample reduction problem associated 
with matching the frequency of data with the hedging horizon, analysis of dynamic hedging at different time- 
horizons has been little studied. In this paper, we overcome this difficulty by combining wavelet multiscale 
analysis with a moving window OLS, to calculate the time and scale dependent covariance structure and hence 
determine the dynamic time horizon dependent hedge ratio. We then build further upon previous studies, by 
measuring the effectiveness at each time-horizon using a value-at-risk (VaR) measure, to assess the tail risk of the 
hedge portfolio at each scale. Finally, to try to understand better the changes in effectiveness at different scales, 
we expand upon previous studies and explore the distributional characteristics of portfolio returns at different 
horizons by determining the scale dependent moments including skewness and kurtosis. A number of implications 
for hedgers emerge from our findings. First, hedgers with a longer time horizon benefit from lower levels of 
risk, higher effectiveness and lower transaction costs. Second, static multiscale hedge ratios, found in previous 
studies, result in a smoothing of the data, which obscures the large dynamical changes that occur over time. A 
dynamic multiscale method is shown to be more appropriate, capturing features not apparent using a static method. 
Finally, while previous studies have demonstrated little hedge portfolio risk at longer scales, we find using a VaR 
effectiveness measure, that excess unhedged tail risk remains. This highlights the weakness of the minimum 
variance hedge, even at long time-horizons. 

The risk of financial asset s is uniquely shap e d by the time-horizon studie d . In the context of hedging, a l imited 



number of studies, incl uding 



(1992); 



Ederington(1979); 



Hill and Schneeweis ( 1982); 



Malliaris and Urrutia 



(1991); 



Benet 



Gepperj (11995b . have demonstrated an increase in hedging effectiven ess for longer horizons, by m a tching 



the fre quency of the data with the hedging horizon. However, out-of-sample. 



Malliaris and Urrutia ( 1991 ); 



( 1992) found a lack of stability in the hedging effectiveness for longer horizons. More recently, 



Benet 



Chen et al. 



(2004) 



demonstrated, using subsampled data, that both the hedge ratio and effectiveness tend to increase with the length of 



time horizon 



Th e effec tiveness of scaled short-term horizon data applied to longer-term horizons was studied by 



Cotter and Hanlv ( 2009), where scaled hedges were shown to provide good hedging effectiveness across a number 



of assets. In all of these studies, the returns were calculated by sub-sampling over different horizons resulting 



'Here, we follow the Chen et al. (2003) breakdown between static and dynamic hedging techniques. These alternative methodologies are 
reviewed here and references therein. 

2 In this article, we define subsampled data as returns calculated from price data of a longer horizon, found by subsampling the original 
asset prices. For example, one can create monthly returns from daily prices by subsampling the data every twenty days and calculating the 
return. However, this has the obvious effect of reducing the sample size available, something we attempt to overcome in this study by using a 
wavelet approach. 
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in reduced quantities of data for longer-term horizons. In order to overcome the sample reduction difficulties 
associated with reduced data quantity, we apply wavelet multiscaling techniques which allow us to compute the 
hedge ratio based on all data available at each scaled 

Using wavelet multiscale analysis, we compute the hedge ratio and study hedging effectiveness at different 
time horizons. Wavelets have previously been applied to a variety of economic and financial time series to de- 
compose the data into orthogonal time-scale components of varying granularitiesQ Recently, wavelet multiscaling 
techniques have been applied to test the dependence of the futures hedge ratio on the underlying time-scale struc- 
ture of the data. By calculatin g the wavelet variance and covariance at different scales for S&P 500 index and 



futures data. lln and Kiml (2006b) showed that there is a unique hedge ratio associated with each scale, which con- 
verges to one for longer scales. Further, using the level of variance reduction as a measure of hedging effectiveness, 
they demonstrated that the hedging effectiveness also converges to one. Similar resul ts were found betw een the 



Australian All Ordinaries Index and the Sydney Futures Exchange Share Price Index, din and Kiim 



A comparison of wavelet multiscale hedge ratios to other approaches has also been addressed, (ILien and Shrestha , 



2006a). 



2007). Comparison to the error-correction hedge ratio revealed an outperformance for short time-horizons, while 
for long time-horizons, the optimal multiscale wavelet ratio was found to dominate, with si milar result s foun d 
both in- and out-of-sample. The optimal hedge ratios for a portfolio of commodities was found. Fernand ez (2008), 



using copulas to measure the asset returns dependency and wavelets to account for hedging horizon. Improved 
hedging effectiveness was found for the portfolio of commodities compared to a single position, with additional 
benefits at longer scales. While these previous studies detailed the effects of time-horizon on the hedge ratio, 
the relationship between the cash and futures is assumed to be static. Assuming a static hedge ratio may restrict 
the introduction of newly available information which may impact the covariance structure and, hence, the hedge 
ratio. In order to incorporate the characteristic time varying covariance, we expand upon these previous studies 
through the development of a dynamic multiscale hedge ratio. 

It is well documented in the literature that th e relati o nship between asset retu rns i s time varying. In t his a rticle, 



we follow the methods of 



Malliaris and UrrutiaJ (119911) . Harris and Shen 



(2003J) and 



Cotter and Hanlv 



(2006) and 



use a rolling window OLS, in order to capture changes in the covariance structure over time. This method, 



3 It should be noted that these data points are based upon the same sample, which may result in a re duction of the prec ision at longer scales. 

4 Early applications included the study of foreign exc hange data using waveform dictionaries, (Ramsey and Zhang, 1997), the decom- 
position of economic relationships, I Ramsey and Lampart, 1998), scaling properties of volatility, (Gencav et ul., 2001b) and the relationship 
between systematic risk and return at different scales, (Gencay et at, 2003). More recently, the relationship be tween stock returns and inflation , 
)Kim and Irj|2005h . the co-skewness and co-kurtosis between equities and the market at various time-scales, (Galagedera and Maharai, 2008), 
the scale dependence of he dge fund market risk and correlation, 4Conlon et aU l2008) and international diversification benefits at different time 
horizons, iRua and Nunes, 2009) have been studied using the wavelet transform. 
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combined with wavelet multiscaling, allows us to measure the hedging effectiveness both in- and out-of-sample for 
different time-horizons, providing a simultaneous time-scale measurement of multiscale hedging effectiveness^ 
Further, the wavelet multiscaling technique used is not subject to downsampling, (or reduction of the number of 
coefficients at longer scales), thus allowing us to align the hedge ratio and effectiveness features at different scales 
for dynamic comparison. 

The performance of the hedging effectiveness at differing time-horizons is measured using two different meth- 
ods, variance and value-at-risk reduction. The variance method alone has been applied in previous wavelet mul- 
tiscale hedge ratio studies and measures the reduction in hedge portfolio variance, compared to the unhedged. 
However, variance assigns an equal weight to positive and negative returns, while a measure that differentiates 
between positive and negative returns may capture the hedger's preferences better. In order to study the effect 
of hedging on the nega t ive tail returns of the h e dge portfolio, w e also use value-at-risk (VaR) reduction, (see 



Cotter and Hanlv(2006) 



Han-is and Shen(2006); 



Cao et al 



(2009))O When returns are normally distributed with 



mean zero, the VaR is simply a multiple of the standard deviation of the portfolio. However, for non-normal 
returns, the VaR takes into account the higher moments of the distribution and so, improves upon the variance. 
The second issue addressed in this article is the measurement of the effectiveness, at different scales, using both a 
standard variance metric and a VaR measure to explore scale dependent tail risks^ 

While a number of studies have shown a reduction in portfolio variance at longer time-horizons, the effect 
of time scale on the skewness and kurtosis, and hence the tail risk of a hedge portfolio, has not been examined. 
The use of variance as a measure of risk is on ly correct when investors have a quadratic utility and returns are 



elliptically distributed, (Harris and Shen 



20061) . When these conditions do not hold, variance cannot characterize 



Harris and Shen (2006) found the 



fully the risks associated with higher moments of the returns^ Previously, 
skewness of the minimum-variance hedge portfolio to be little changed, while the portfolio kurtosis tended to 
increase compared to the unhedged asset, using original daily returns data. Thus, the final issue addressed in this 
article is the effect of time-scale on the portfolio skewness and kurtosis and hence the risk of the hedged futures 
portfolio. 

This paper is organized as follows. In Section|2] we describe the application of wavelets to decompose returns 



5 Alternative approaches, not pursued here, to capture the time- varying covariance structure include GARCH models, (Chen et all l2003l) . 

Alternatively, we could examine hedging for positive tail returns but, for conciseness, we illustrate the use of VaR as a performance 
evaluation method for a single side of the distribution 

7 The semivariance was also tested as a measure of hedging effectiveness at each scale. However, both in- and out-of-sample the effective- 
ness was found to be very similar to that of th e variance measure. These results are available on request. 

8 See Christie-David and Chaudhrv (2001) where they demonstrate the importance of skewness and kurtosis in explaining the return- 
generating process of futures. 
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into component scales and then describe the optimal hedge ratio and hedging effectiveness measures. Data and 
empirical results are described in Section [3] while some concluding remarks are given in Section 2] 



2 Methodology 



2.1 Wavelet Multiscale Analysis 

We provide a short syn o psis of wavelet multiscale an alysis relevant to this study, (for more comprehensive detail, 



see 



Burrus et al. 



(1997); 



Percival and Walden (2000)). The discrete wavelet transform provides an efficient means 
of studying multiresolution properties, as it can be used to decompose a signal into different time horizons or 
frequency components. There are two basic wavelet functions, the father wavelet (f> and mother wavelet rp, which 
can be scaled and translated to form a basis for the Hilbert space L 2 ($i) of square integrable functions. The father 
and mother wavelets are formally defined by the functions: 



Jifc (t) = 2-ict>(2H-k) 



(1) 

(2) 



where j = 1, ... J is the scaling parameter in a J-level decomposition and k is a translation parameter. The long 
scale trend of the time series is captured by the father wavelet, which integrates to 1, while the mother wavelet, 
which integrates to 0, describes fluctuations from the trend. The wavelet representation of a discrete signal f(t) in 
L 2 (5R) is given by: 



/(*) = £«J,*fok(*)+£ 

dj,k4>J,k (t) + ... + 5^di,k&,fc(t) 



(3) 



where k ranges from 1 to the number of coefficients in the specified level and J is the number of multiresolution 
levels, (scales). Smooth and detail component coefficients, sj t k and dj t k, are found by integrating over time, dt, 



•"•././. = / (j>j,kf(t)dt 
dj,k = I il>j,kf(t)dt {j = 1, ... J) 



(4) 
(5) 
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Each coefficient sets sj,dj, dj-i, . . . di is called a crystal, where coefficients from level j = 1 . . . J are associ- 
ated with scale [2 j ~ 1 ,2 : >]. 



2.1.1 MODWT 



In order to overcome some of the difficulties associated with the DWT, in this paper we adopt the maximum overlap 
discrete wavelet transform (MODWT), a highly redundant linear filter that transforms a se ries into coefficients 



related to variations over a set of scales, (IPercival an d Walden 



2000; 



Gencav et al. 



2001a). The MODWT has 



several advantages over the DWT, allowing alignment of wavelet scaling and detail coefficients with the original 
time-series. The MODWT can also handle any sample size N, whereas the DWT restricts the sample size to a 
multiple of 2 3 . Here, we apply the MODWT as it allows us to explore any sample size, align the coefficients with 
the original data and calculate the wavelet variance and covariance effectively at different scales. 

Like the DWT, the MOWDT produces a set of time-dependent wavelet and scaling coefficients with basis 
vectors associated with a location t and scale Tj — [2 J_1 , 2 3 ] for each decomposition level j = 1, . . . , Jq. How- 
ever, the MODWT is nonorthogonal and has a high level of redundancy, retaining downsampled values at each 
level of the decomposition that would be discarded by the DWTO Decomposing a signal using the MODWT to J 
levels theoretically involves the application of J pairs of filters. The filtering operation at the j th level consists of 
applying a rescaled father wavelet to yield a set of detail coefficients 



L<-1 



t-i 



(6) 



and a rescaled mother wavelet to yield a set of scaling coefficients 



Li-l 



1-1 



(7) 



1=0 



for all times t = ...,—1,0,1,..., where / is the function to be decomposed, (IPercival and Wa lden. 



2000). The 



rescaled mother, t/^j = and father, (f>j it = wavelets for the j th level are a set of scale-dependent 
localized differencing and averaging operators and can be regarded as rescaled versions of the originals. The j th 
level equivalent filter coefficients have a width Lj = (2 J — 1)(L — 1) + 1, where L is the width of the j = 1 base 
filter. In practice, the filters for j > 1 are not explicitly constructed because the detail and scaling coefficients 



'Downsampling or decimation of the wavelet coefficients retains half of the number of coefficients that were retained at the previous scale 
and is applied in the Discrete Wavelet Transform. By retaining all coefficients at each scale, the MODWT has 'redundant' coefficients or 
coefficients not necessary to recreate the original signal. This, however, results in significant benefits for multiscale analysis, described above. 
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can be calculated, using an algorithm that involves the j = 1 filters op erating recurrently on the j level scaling 
coefficients, to generate the j + 1 level scaling and detail coefficients, (IPercival and WaldenL 2000). 



2.1.2 Wavelet Moments and Covariance 



The wavelet variance Variance f (jj ) at scale j is defined as the expected value of D 2 t if we consider only the 



non-boundary coefficients L_] An unbiased estimator of the wavelet variance for function f(t) at scale j is formed 
by removing all coefficients that are affected by boundary conditions and given by: 



1 JV ~ 1 

Variance/ fa) = ^22 

3 t=Li-l 



(8) 



where Mj = N — ■. + 1 is the num ber of non-boundary coefficients at the j level associated with the time 



horizon r, iPercival and Waldenl (|2000). The wavelet variance decomposes the variance of a process on a scale- 
by-scale basis (at increasingly higher resolutions of the signal) and allows us to explore how a signal behaves at 
different time horizons. 

Similarly, the wavelet skewness and kurtosis can be defined on a scale-by-scale basis. Assuming that the 
wavelet coefficients Djj at each scale have zero mean, the unbiased skewness at each scale is given by 



Skewness /fa) 



(9) 



while the unbiased kurtosis at each scale is 



Kurtosis f(rj) 



Mi S 



N—l 

Mi l~*t=Li- 



n 4 

i U 3,t 



(10) 



S,t) 



with o- 2 t(Tj) = Variance /(tj) the standard deviation of the wavelet coe fficients at scale j. Similarly, fo rmulas 



for the co-skewness and co-kurtosis at different scales have been derived, (Galagedera and Maharai 



2008). 



In our analysis, we use (0 and (fTUl l. to examine the higher moments of the hedge portfolio, in order to gain 
insight into the distributional effects of scaling. As described in lHarris and Shenl (120061) . the skewness and kurtosis 
of the minimum variance hedge portfolio can be larger than that of the individual assets, creating a need for a 
measure of risk that captures these higher moments. Here, we also address the question of how time horizon 
effects the higher moments of the hedge portfolio and hence, the tail risk of the hedge portfolio. 



10 The MODWT treats the time -series as if it were periodic using "circular boundary conditions". There are Lj wavelet and scaling coeffi- 
cients that are influenced by the extension, which are referred to as the boundary coefficients. 



The wavelet covariance between functions f(t) and g(t) is defined, similar to (©, to be the covariance of the 
wavelet coefficients at a given scale. The unbiased estimator of wavelet covariance at the j th scale is given by 



N-l 

Covariance^) = — £ f^fl^f (11) 

1 i=L,-l 



where all wavelet coefficients affected by the boundary are removed and Mj = N—Lj+1, (see 



Percival and Walden 



(2000) for a complete treatment of wavelet moments). 



2.2 Minimum Variance Hedge 

In this paper, we use the wavelet transform so as to calculate the minimum variance hedge ratio at different time 
horizons. For an individual holding a spot position in some asset, hedging involves taking an opposite position in 
the futures market. Assuming a long position in the spot market, the return on a hedge portfolio is given by 

r t = s t - hf t (12) 

where f t and s t are the log returns of the futures and spot markets at time t and h is the hedge ratio. The risk of a 
portfolio, commonly given as the variance in returns is 

Var(r t ) = Var(s t — hf t ) 

= Var(s t ) + h 2 Var(f t )-2hCov{s t J t ) (13) 

The static minimum variance hedge ratio is the value of h that minimizes ( TT3l >. and given by 



h = CoviancejsJ) 
Variance(f) 



where Covariance(sf) is the covariance between the spot and futures returns and Variance(f) the variance of 
the futures returns. However, the time variation of the variance-covariance ma trix is a well known featu re of many 



financial asset returns leading to an optimal time-dependent hedge ratio, h t , ([Kroner and Sultan , 



1993), 



_ Coviance(s t , ft) ^ 
Variance(ft) 
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In order to account for this time-variation in the variance- covariance matrix, a rolling window OLS approach is 
used, with all observations given an equal weighting. This approach combines well with the wavelet transform, 
allowing a dynamic scale dependent analysis of the hedge ratio. To calculate the hedge ratio at each scale, we 
simply replace the variance and covariance in (lT5t by that found using the wavelet coefficients at each scale for 
each moving window, ([8) and ( fTTT i. 

2.3 Hedging Effectiveness 

We examine both in-sample and out-of-sample hedging performance using two different performance metrics, 
variance reduction, which incorporates both upside and downside risk, and value-at-risk reduction which captures 
risk for one side of the distribution in our case. 

Variance reduction measures the percentage reduction in the variance of a hedge portfolio compared to the 
unhedged spot position and is given by 



_ Variance(r t ) 

^variance ■ / \ 

V ariance(St) 



where Variance(rt) and Variance(st) are variance of the returns for the hedge portfolio and spot respectively. 

To study the effect of hedging on negative tail returns and measure risks posed by higher moments of portfolio 
returns, we use Value-at-Risk ( VaR), which e s timates the maximum p ortfolio expected loss for a given confidence 



level over a given time period, (Jorion 



(2006) 



Harris and Shen 



(2006)). The VaR at confidence level a is 



VaR a = q a (17) 

where q a is the relevant quantile of the loss distribution. The effectiveness from the point of view of Value-at-Risk 
reduction is then measured by 



„ w _-, VaRM 

tih VaR - 1 - , r (18) 

VaR a (s t ) 



with VaR a (r t ) and VaR a (s t ) the value-at-risk of the hedge portfolio and spot, at confidence level a. 
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3 Empirical Analysis 



3.1 Data 

For the empirical analysis, we choose examples of three asset classes. We dynamically hedge long spot exposures 
in West Texas Intermediate (WTI) Crude Oil, the S&P 500 Equity Index and the GBP '/U SD currency exchange 
rate. These assets were chosen to represent a diverse set of highly liquid cash and futures markets, where a long 
returns history for both the spot and futures markets was available^ Each long spot position is hedged by taking 
a short position in the corresponding futures contract. The empirical results are found using daily returns data for 
the period 02 Jan 1986 to 31 December 2009: 

1 . For WTI Crude Oil, the corresponding futures contract is the New York Mercantile Exchange (NYMEX) 
contract, giving a total of 6259 trading days. 

2. The S&P 500 data consists of 6238 daily traded returns, with the futures contract traded on the Chicago 
Mercantile Exchange. 

3. The British Pound to US Dollar exchange rate, with a total of 6261 daily returns and the futures contract is 
traded on the Chicago Mercantile Exchange. 

The timeframe examined was chosen as it covered a large number of different adverse events for each asset, 
allowing a detailed study of the effects of scaling on hedging effectiveness, during both normal and turbulent 
markets. Data was obtained from Datastream, using closing prices for the spot index and the corresponding daily 
settlement price for the futures contract. Each futures contract studied is nearest-to-maturity and rolled over to the 
next contract on the first day of the contract month. 

As outlined in Section 12.1.11 we decompose both the cash and futures returns by employing t he MODWT. 



For th e present study, we selected the least asymmetric (LA) wavelet, (known as the Symmlet, (IBurrus et al. 



1997)), chosen as it exhibits near symmetry about the filter midpoint and has the property of aligning the wavelet 



coefficients accurately with the unfiltered time seriesPjj LA filters are defined in even widths and the optimal filter 
width is dependent on the characteristics of the signal and the length of the data series. The filter width chosen 
for this study was the LA8, (where 8 refers to the width of the scaling function). The length of the rolling window 

u 

used in the analysis is 1000 dayo and we chose scale 6, corresponding to 32 — 64 day dynamics, as the largest 



"Additional assets, (eg. Gold), were also studied and the results found to be consistent. These are not presented for conciseness. 
12 The Daubauchies D4 and the Coifiet C10 wavelets were also studied but resulted in little qualitative difference to the analysis. 
13 Different window sizes were also studied, with longer windows found to smooth changes in the hedge ratio, while shorter windows 
resulted in more volatile ratios. However, the main results found in this paper were qualitatively the same regardless of the window studied. 
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decomposition level, to strike a balance between the maximum scale and the number of boundary coefficients. As 
described in Section |2~T1 the scales studied can be interpreted as follows: Scale 1 — > 1 — 2 day, Scale 2^2 — 4 
day, Scale 3 -> 4 - 8 day, Scale 4 -> 8 - 16 day, Scale 5 -> 16 - 32 day and Scale 6 -4 32 - 64 day dynamics. 

Summary statistics, including the mean, standard deviation, skewness, and kurtosis for each of the assets at 
each scale can be found in Table Q] Starting with the original returns data, we find the common stylized features 
of financial returns, namely, excess kurtosis and a lack of normality for both spot and futures. Turning to the scale 
statistics, as described in Section l2~Tl the mother wavelet integrates to zero, and so the mean value for the wavelet 
decomposed data at each scale is zero. As found in previous studies, the standard deviation decreases at lower 
scales, with total variance conserved. The skewness is found to be predominantly negative across assets and scales, 
while excess kurtosis is found at all scales. However, the level of kurtosis is found to decrease significantly at long 
scales. The hypothesis of normality for the returns coefficients associated with each scale is found, however, to be 
rejected by the Jacque-Bera statistic for all assets. 

[Table 1 about here.] 

3.2 Dynamic Hedging With Subsampled Data 

In this paper, we use wavelet multiscaling techniques to study the dynamic scale dependent hedge ratio, in order 
to overcome the sample reduction problems associated with sub-sampling data. To demonstrate the sub-sampling 
problems in the case of Crude Oil, we first study the changes in the optimal dynamic hedge ratio (and associated 
hedging effectiveness) for a number of sub-sampled time-horizons Q This is achieved by calculating asset returns 
from price data sampled every 3, 6 and 12 days, reducing the quantities of data available for analysis. This ignores 
any information contained in the unused data, a problem that is overcome using wavelet multiscaling. To allow 
for comparison with the rolling window wavelet techniques later, we estimate the returns in a rolling window of 
200 days, calculate the hedge ratio and then move forward one time-period, (dropping the first observation)!^ 

The results, averaged over each moving window for the time-horizons described are shown in Table As 
expected, an increase in the hedge ratio is found at longer time-horizons, with a corresponding increase in the 
hedging effectiveness. The effects on skewness and kurtosis are more difficult to determine, with differing trends 
across scales, although the kurtos is using 3, 6 and 12 d ay returns is lower than for the original daily returns. 



However, as previously shown by 



Harris and Shen 



(2006) for cross-hedged currency portfolios, the kurtosis of the 



14 Results were found to be consistent for the other assets studied but are not reported for brevity. 
15 Longer windows reduced the available data further and would impair a detailed rolling window analysis. 
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hedge portfolio was found to be greater than that of the spot for all returns horizons. 

The difficulties of using sub-sampled data can be seen more clearly in Figure Q] where in-sample hedging 
effectiveness, measured using variance reduction, for each time-window at each hedging horizon is shown. The 
dynamic nature of hedging effectiveness is clearly visible, with quite dramatic variations using the original data. 
The increased effectiveness at longer horizons is visible; however the reduction in sample data makes it difficult 
to match and compare features for different scales at any given point. Further, the reduced quantities of available 
data prevents analysis at scales much longer than 12 days as the statistical quality of the data deteriorates quickly, 
while also preventing a sensible out-of-sample analysis. To attempt to overcome these difficulties, we apply 
wavelet decomposition. 

[Table 2 about here.] 
[Figure 1 about here.] 

3.3 Dynamic Scale Dependent Hedging 

In order to investigate the effects of both time and scale on the hedge ratio and hence the hedging effectiveness, we 
calculate the optimal dynamic hedge ratio using a moving-window technique. This was implemented as follows: 
The wavelet coefficients for both spot and futures returns were calculated, up to the sixth scale, using a moving 
window of 1000 days, allowing the variance, ([8j, covariance, (fTTT i. and minimum variance hedge ratio, ([I5t . to 
be found for each scale in each window. The in-sample hedging effectiveness at each scale was determined using 
the wavelet coefficients from the first 1000 days. The out-of-sample effectiveness at each scale was measured 
by applying the in- sample hedge ratio to the w avelet coefficients calculate d over the next 1000 days , (following 



the an alysis of 



Benetl ( ll992l) : 



Chen et al. 



Lien and Shrestha 



(2007); 



Fernandez 



(2004) using subsampled data and 
(2008) using wavelet filtered data). Thereafter, the observation at T + 1 is incorporated into the data and the first 
observation excluded, with the above process repeated. 

To illustrate, the dynamic hedge ratio for Crude Oil is shown, Figure [2] with the dynamic minimum variance 
hedge ratio for the original data shown in the upper plot, while those found at wavelet scales 1, 3 and 5 are shown 
in the lower plotsQ We find that the hedge ratio tends to one as we move to longer time scales. However, we 
find that the ratio is far from static, in particular at short time-scales. The dynamics of the ratio at scale 1,(1 — 2 
days), has a trend similar to that of the original data, although the value of the ratio is reduced somewhat. By 



16 For brevity, the plots of the moving window analysis are not shown for the other assets. However, summary statistics are shown in Table 
[4]and Table[5] while the plots are available from the authors upon request. 
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scale 5, the ratio has converged to one; however there is evidence of some spikes in the data, which may be a 



result of considerable basis risk at that pointrj The less dynamic nature of the hedge ratio at longer scales would 
have a major impact on the level of transaction costs involved for hedgers with a long term horizon. For a hedger 
with a horizon of 16 — 32 days and above (scale 5 and above), the convergence of the hedge ratio to one means 
a consistent hedge, reducing the considerable transaction costs associated with the changes in the hedge ratio at 
shorter horizons. 

[Figure 2 about here.] 

The hedging effectiveness, measured using variance reduction, dT6b . for each moving window is shown in 
Figure|U with both the in- and out-of-sample effectiveness overlaid for comparative purposes. There are a number 
of interesting points to note here. First, the in-sample and out-of-sample effectiveness tend to be very close at 
all scales, indicating favourable perf ormance of the dynamic multiscale hedging. This is in contrast to the results 



found bv lLien and Shresthal (12007). where the out-of-sample hedging effectiveness for Crude Oil was shown to 
improve relative to the in-sample at longer scales, only for a single window. Second, moving to longer time scales, 
the variance reduction effectiveness (both in- and out-of-sample) increases significantly and by scale 6, (32 — 64 
days), the effectiveness has, on average converged to one, (see Tabled. However, it must be noted that there occurs 
a number of singularities where the hedging effectiveness drops considerably at long scales. Singularities such as 
these were not witnessed in previous multiscale hedging studies, as they examined the multiscaled properties of the 
entire dataset, effectively smoothly out these events but ignoring the dynamic nature of variance and covariance. 
These large drops in effectiveness occur at days with large basis risk, demonstrating that even with a long hedging 
horizon, a hedger may be subject to basis risk. 

[Figure 3 about here.] 

We now consider the effect of the minimum-variance hedge ratio on the 95% value-at-risk of the hedge port- 
folio in Figure [4] Similar to the analysis for variance reduction, we find that in- and out-of-sample effectiveness 
track each other closely. Also, the hedging effectiveness was found to increase at longer scales. However, at scale 
5 (16 — 32 days), the average VaR hedging effectiveness was 88%, compared to 98% for the variance reduction 
measure, resulting from residual unhedged tail risk. In fact, across all scales, VaR effectiveness was found to be 
weaker compared to the variance reduction measure. This reduced effectiveness is due to the use of the minimum 



17 For example, on 28th January 1991, coinciding with the end of the First Gulf War, spot Crude Oil fell in price by 6.0%, while the futures 
fell by only 0.8%. 
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variance hedge, which considers only the second moment of the returns distribution. As shown in Table Q] asset 
returns at all scales are non-normal, resulting from fat tails of the returns distributions. To determine if higher 
moments are the cause of the differences between effectiveness measures, we consider the skewness and kurtosis 
of both the hedged and unhedged portfolios at each scale. 

[Figure 4 about here.] 

Table [3] displays summary statistics across windows and shows the average hedge ratio, hedging effectiveness 
(95% VaR and variance reduction), and the standard deviation, skewness and kurtosis of both the unhedged asset 
and the hedge portfolio at each scale for WTI Crude OilQ These results are shown both in- and out-of-sample 
and, in both cases, we find that the standard deviation is reduced for both the unhedged and hedged portfolios at 
longer scales. The skewness although predominantly negative, is more ambiguous across scales, with no distinct 
trend found. However, considering the k u rtosis , we find it is greater for the hedge portfolio across all scales, 



similar to that found by 



Harris and Shen 



(2006) for daily returns data. Also, for both hedged and unhedged 
portfolios, the level of kurtosis drops consistently as we move to longer scales. However, even at the longest scale 
studied, we find excess kurtosis for the hedge portfolio, (while the unhedged portfolio has zero excess kurtosis) 



This is in keeping with the findings of 



lie 





Harris and Shen (2006) using daily unfiltered data and this may indicate 



that the excess kurtosis is behind the reduced effectiveness of the hedge portfolio from a value at risk perspective, 
(compared with the variance reduction me asure). This indicates that a technique that explicitly accounts for higher 



moments, such as VaR minimisation, see 



Harris and Shen (2006); 



Cao et al. 



( 2009), may be more appropriate in 



reducing the risk, even at longer time horizons. 



[Table 3 about here.] 



Results, averaged across each moving window, for a minimum variance hedge portfolio consisting of a long 
position in the S&P 500 hedged with a short index futures position, are shown in Table|4]for different time scales. 
Similar to that seen for Crude Oil, both the hedge ratio and the hedging effectiveness tend to increase at longer 
scales, (both in- and out-of-sample), although the VaR effectiveness tends to be lower than that measured by vari- 
ance reduction. Comparing the in-sample and out-of-sample results, we find that the dynamic minimum variance 
hedge ratio has slightly better in-sample effectiveness, although the small differences indicate the robustness of 



18 The averages were found across each overlaid moving window using only data where both in- and out-of-sample results were available 
concurrently. This allows a direct comp arison between the in- and out-of -sample results. 

"This is in contrast to the findings of Galagedera and Maharai (2008), where the MODWT was used to demonstrate the excess kurtosis of 
a portfolio of equities to be, on average, positive at short scales while consistently negative at longer scales. 
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the method. Examining the skewness of the returns for both the unhedged and hedged portfolio, again no distinct 
trend emerges across scales, although the hedged portfolio is found to always have negative skewness, in contrast 
to the unhedged asset. Finally, analysing the level of kurtosis, we find limited situations where the kurtosis of 
the hedge portfolio is less than that of the unhedged portfolio, (at scales 1 and 6), something not witnessed for 
the other assets studied. For the unhedged asset we find the kurtosis at scale one to be greater than that using the 
unfiltered original data, indicating that this ti me-scale pick s up some large tail risks, (or large amplitude noise), 



not common to other scales. As described by 



Benet ( 1992), this may indicate presence of a large amount of price 



uncertainty in this market for short time scales, while at longer timescales more information reduces the amount 
of uncertainty and hence the level of basis risk. However, similar to the other assets we find kurtosis decreases 
at longer scales for both the unhedged and hedged assets. As the VaR hedging effectiveness is only 0.87 at the 
longest scale, this suggests that higher order moments may also have an influence on the tail risk of the portfolio, 
(again, a VaR minimisation technique might help in reducing or eliminating these risks). 

[Table 4 about here.] 

The final dataset examined is a British Pound/US Dollar spot position hedged using futures, with results 
averaged over each 1000 day moving window, shown in Table[5] As found previously for other assets, the hedging 
effectiveness was similar in- and out-of-sample across all scales, indicating the robustness of the dynamic method 
for futures hedging. Similar to previous assets, we also find that the hedge ratio and effectiveness at the first scale, 
(1 — 2 day horizon), is substantially reduced compared to the other scales and to the original data. This suggests 
that there is more market uncertainty at short scale s or in a statis tical sense that there is greater noise, increasing the 
difficulty of measuring the hedge ratio accurately, Benetl (119921) . However, by the second scale (2 — 4 day horizon) 



the effectiveness is greater than that found using the original data, and then increases to a maximum of 0.98 
(variance reduction) or 0.88 (VaR reduction) at long scales. Examining the skewness, we find that the unhedged 
portfolio skewness turns negative at the fourth scale, while for the hedge portfolio it remains positive for the 
majority of scales. Finally, the kurtosis of the hedge portfolio is found to be greater than the unhedged across all 
scales, with both decreasing at larger scales. However, even at th e largest scale, excess ku rtosis persists suggesting 



that variance minimisation may not eliminate all portfolio risk, (Har ris and Shen 



2006). This is substantiated by 



the residual portfolio tail risk found using VaR effectiveness, even at long scales. 



[Table 5 about here.] 
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Common to all the data sets studied is a reduction in the standard deviation of the dynamic hedge ratio at 
longer scales. Compared to an agent with a short time-horizon, one with a longer horizon can substantially reduce 
the transaction costs involved, further enhancing the benefits of long-horizon hedging. This results from the 
convergence of dynamic hedging to an almost static hedging strategy at long horizons producing, as demonstrated, 
improved levels of risk management. VaR hedging effectiveness was found, for all assets across scales, to be less 
than that measured using variance reduction. Similarly, the kurtosis was found to be larger for the hedge portfolio 
at all scales, while both the hedge and unhedged kurtosis decrease at longer scales. By incorporating dynamic 
changes in covariance, we have demonstrated the benefits of a time varying multiscale hedge ratio, compared to 
the static multiscale approach considered in previous studies. 

4 Conclusions 

In this study, we apply the wavelet transform to investigate multiscale properties of both the hedge ratio and effec- 
tiveness of a futures hedge in a dynamic framework. We extend previous work by combining a moving-window 
OLS with wavelet decomposition in order to examine the time-scale behaviour dynamically. By calculating the 
minimum variance optimal hedge ratio at different wavelet scales in each window, we examine both the in- and 
out-of-sample effectiveness. Studying the results over the different moving-windows, we demonstrate the effec- 
tiveness of the dynamic method through the close tracking of the in- and out-of-sample hedging effectiveness 
at all scales. The scale dependence of the hedge ratio and the convergence to one for longer time-horizons are 
also shown, with a reduction in the standard deviation of the hedge ratio at longer scales, (leading to reduced 
transaction costs for a hedger with long horizon). 

Hedging effectiveness is measured first by calculating the fraction of the unhedged portfolio variance removed 
by hedging. However, the variance measures only the second moment of the returns distribution and may not 
capture rare negative tail returns. To test the hedging performance in the negative tail of the returns, we also 
measure the fraction of the 95% Value-at-Risk of the unhedged portfolio removed by hedging. For both measures 
of hedging performance, the effectiveness is found to increase for longer time-horizons both in- and out-of-sample, 
with the variance reduction measure converging to one for all assets. However, measured using Value-at-Risk, the 
effectiveness, although increasing does not converge to one at the longest horizons studied. Thus, the application 
of variance minimisation to find the optimal hedge ratio, minimizes the portfolio variance but ignores higher 
moments, resulting in excess residual tail risk for the hedge portfolio even at long time horizons. 
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To investigate further the effects of minimum variance hedging at different scales, we examine returns distribu- 
tion at all scales. The skewness of hedge portfolio returns has little consistency across assets or scales. However, 
the kurtosis for both the hedged and unhedged portfolios decreases as the hedging horizon increases, reducing the 
levels of tail risk, (as evidenced by the improvement in the VaR effectiveness measure). The portfolio kurtosis is, 
on average, greater than the unhedged asset. For both Crude Oil and British Pound/US dollar hedges, the hedge 
portfolio has excess kurtosis at all scales perhaps contributing to the extra tail risk found using VaR effectiveness. 

The implications of our findings are as follows: Hedgers with a longer time horizon benefit from lower levels 
of risk, higher effectiveness and lower transaction costs. The static multiscale hedge ratios, found in previous 
studies, result in a smoothing of the data, which obscures the large dynamical changes that occur over time. A 
dynamic multiscale method is shown to be more appropriate, capturing features not apparent using a static method. 
Additionally, while previous studies have demonstrated little hedge portfolio risk at longer scales, we find using 
a VaR measure, that excess unhedged tail risk remains. This highlights the weakness of the minimum variance 
hedge, even at long time-horizons. 
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Figure 1 : Dynamic hedging effectiveness 

Dynamic variance reduction hedging effectiveness is presented, in-sample, for a long Crude Oil position hedged 
with futures, for sub-sampled returns data with various time horizons, calculated using a rolling window of 200 
days. The effectiveness improves at longer time horizons, however the number of time periods available for 
analysis decreases considerably impairing comparison between different horizons. 
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Figure 2: Crude Oil: Dynamic multiscale hedge ratio. 

Notes: The dynamic minimum variance hedge ratio for Crude Oil was found using a rolling window of 1000 days. The static hedge ratio, calculated using all available data is 
also shown. At short scales, the dynamic hedge ratio is found to vary considerably over time, while at longer scales, although there are a number of spikes, the ratio converges to 
one. 
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Figure 3: Crude Oil: Dynamic multiscale hedging effectiveness. 

Notes: Hedging effectiveness, measured in terms of variance reduction, was found using a rolling window of 1000 days with in- and out-of-sample results overlaid. In- and 
out-of-sample results are found to track closely at all scales, while the effectiveness is found to increase at longer scales, with less variation. 
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Figure 4: Crude Oil: Dynamic multiscale value-at-risk hedging effectiveness 

Notes: Hedging effectiveness, measured using VaR reduction, was calculated using a rolling window of 1000 days with in- and out-of-sample results overlaid. In- and out-of- 
sample results are found to track closely at all scales, while the effectiveness is found to increase at longer scales. However, compared to the variance reduction effectiveness 
measure, (Figure |3}, some residual levels of risk remain even at the longest scale studied. 
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Table 1: Descriptive statistics for log returns of Futures and Spot Series at different time-scales for Crude Oil, S&P 500 Equity Index and GBP/USD Exchange Rate. 

Notes: The mean and standard deviation of each series is given in percentage terms, while a skewness of zero indicates no skewness and a kurtosis of 3 indicates no excess 
kurtosis. The Jacque-Bera statistic tests the null hypothesis that the distribution is normal and this hypothesis is rejected for all assets at all scales. 
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Table 2: Hedge portfolio summary statistics, consisting of a long position in Crude Oil, hedged using futures. 

Notes: The hedge ratio was calculated using data sub-sampled at various time-horizons. Shown are the hedge ratio and in-sample variance reduction hedging effectiveness, along 
with the standard deviation (in %), skewness and kurtosis for hedge portfolio returns data, averaged over each available moving window. 
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9.79 


Scale 5 


1.00 


0.98 


0.88 


0.30 


0.00 


-0.17 


-0.09 


3.15 


7.64 


Scale 6 


1.00 


0.99 


0.94 


0.30 


0.99 


-0.17 


-0.10 


2.86 


6.57 



Table 3: Statistics for Crude Oil unhedged and hedged portfolios at different scales 

Notes: Shown are the in- and out-of-sample hedge ratio, hedging effectiveness, standard deviation (in %), skewness and kurtosis, averaged over each moving window for the 
unhedged and minimum-variance hedged portfolio at different scales. 



In-Sample Hedging Effectiveness Standard Deviation Skewness Kurtosis 





Hedge Ratio 


Variance 


95% VaR HE 


Unhedged 


Hedged 


Unhedged 


Hedged 


Unhedged 


Hedged 


Original Data 


0.89 


0.93 


0.74 


1.00 


0.30 


-0.17 


-0.70 


6.23 


7.32 


Scale 1 


0.86 


0.90 


0.68 


0.70 


0.20 


0.05 


-0.16 


6.47 


5.33 


Scale 2 


0.90 




yj./o 


0.50 


0.10 


0.00 


-0.27 


4.34 


6.18 


Scale 3 


0.94 


0.98 


0.85 


0.40 


0.10 


-0.06 


-0.38 


4.54 


6.56 


Scale 4 


0.96 


0.98 


0.85 


0.20 


0.0 


-0.03 


-0.60 


4.03 


5.46 


Scale 5 


0.98 


0.99 


0.86 


0.20 


0.0 


0.17 


-0.75 


3.76 


4.28 


Scale 6 


0.97 


0.98 


0.87 


0.10 


0.0 


-0.04 


-0.03 


3.38 


3.17 


Out-of-Sample 


Hedging 


Effectiveness 


Standard Deviation 


Skewness 


Kurtosis 




Hedge Ratio 


Variance 


95% VaR HE 


Unhedged 


Hedged 


Unhedged 


Hedged 


Unhedged 


Hedged 


Original Data 


0.89 


0.92 


0.74 


1.00 


0.30 


-0.17 


-0.58 


6.23 


6.65 


Scale 1 


0.86 


0.89 


0.66 


0.70 


0.20 


0.05 


-0.11 


6.47 


4.94 


Scale 2 


0.90 


0.94 


0.76 


0.50 


0.10 


0.00 


-0.25 


4.34 


5.56 


Scale 3 


0.94 


0.97 


0.83 


0.40 


0.10 


-0.06 


-0.29 


4.54 


6.31 


Scale 4 


0.96 


0.98 


0.84 


0.20 


0.0 


-0.03 


-0.49 


4.03 


5.02 


Scale 5 


0.98 


0.98 


0.86 


0.20 


0.0 


0.17 


-0.64 


3.76 


4.16 


Scale 6 


0.97 


0.98 


0.87 


0.10 


0.0 


-0.04 


-0.03 


3.38 


2.97 



Table 4: Statistics for S&P 500 Equity Index unhedged and hedged portfolios at different scales 

Notes: Shown are the in- and out-of-sample hedge ratio, hedging effectiveness, standard deviation (in %), skewness and kurtosis, averaged over each moving window for the 
unhedged and minimum-variance hedged portfolio at different scales. 



In-Sample 


Hedging 


Effectiveness 


Standard Deviation 


Skewness 


Kurtosis 




Hedge Ratio 


Variance 


95% VaR HE 


Unhedged 


Hedged 


Unhedged 


Hedged 


Unhedged 


Hedged 


Original Data 


0.70 


0.57 


0.37 


0.50 


0.30 


0.00 


0.10 


4.83 


5.97 


Scale 1 


0.55 


0.37 


0.24 


0.40 


0.30 


0.08 


0.05 


3.93 


4.69 


Scale 2 


0.79 


0.69 


0.45 


0.30 


0.10 


0.03 


0.03 


4.17 


5.67 


Scale 3 


0.91 


0.88 


0.66 


0.20 


0.10 


0.01 


0.01 


3.72 


5.81 


Scale 4 


0.96 


0.96 


0.81 


0.10 


0.0 


-0.10 


0.20 


4.55 


5.53 


Scale 5 


0.96 


0.98 


0.88 


0.10 


0.0 


-0.12 


0.45 


3.48 


4.53 


Scale 6 


0.99 


0.98 


0.88 


0.10 


0.0 


-0.07 


0.00 


2.89 


4.02 


Out-of-Sample 


Hedging 


Effectiveness 


Standard Deviation 


Skewness 


Kurtosis 




Hedge Ratio 


Variance 


95% VaR HE 


Unhedged 


Hedged 


Unhedged 


Hedged 


Unhedged 


Hedged 


Original Data 


0.70 


0.56 


0.37 


0.50 


0.30 


0.00 


0.09 


4.83 


5.98 


Scale 1 


0.55 


0.35 


0.23 


0.40 


0.30 


0.08 


0.05 


3.93 


4.85 


Scale 2 


0.79 


0.69 


0.45 


0.30 


0.10 


0.03 


0.02 


4.17 


5.65 


Scale 3 


0.91 


0.88 


0.66 


0.20 


0.10 


0.01 


0.00 


3.72 


5.73 


Scale 4 


0.96 


0.96 


0.80 


0.10 


0.0 


-0.10 


0.17 


4.55 


5.51 


Scale 5 


0.96 


0.98 


0.87 


0.10 


0.0 


-0.12 


0.43 


3.48 


4.48 


Scale 6 


0.99 


0.98 


0.88 


0.10 


0.0 


-0.07 


-0.03 


2.89 


3.97 



Table 5: Statistics for GBP/EUR Exchange Rate unhedged and hedged portfolios at different scales 

Notes: Shown are the in- and out-of-sample hedge ratio, hedging effectiveness, standard deviation (in %), skewness and kurtosis. 
unhedged and minimum-variance hedged portfolio at different scales. 



,, averaged over each moving window for the 



